rancn 



Verified by: 



F Errors Corrected by the STIC^ptems Brancn -hi 
firtal Number': 0^^2 99 ft _ SfJ^^ 8 " 

□ 

□ 

□ 



Changed a fife 'from non-ASCII to ASCII 

Changed the margins in cases where the sequence text was \vrapped" c 
Edited a format error in the Current Application Data section, specrficaJly 



IC sta: 




Edited the Current Application Data section with the actuaJ current number. The numtep inputted D^nthe 
applicant was O the prior application data; or O other . 

Added the mandatory heading and subheadings for 'Current Application Data". 

Edited the "Number of Sequences" field. The applicant spelled out a number'instead of using an integer. 
Changed the spelling of a mandatory field (the headings or subheadings), specifically: 



Corrected the SEQ ID NO when obviously incorrect. The sequence numbers that were edited were: 



Inserted or corrected a nucleic number at the end of a nucleic lino. SEQ ID NO's edited: 



Corrected subheading placement. All responses must be on the same line as each subheading. If the 
applicant placed a response below the subheading, this was moved to its appropriate place. 

Inserted colons after headings/subheadings. Headings edited included: 



Deleted extra, invalid, headings used by an applicant, specrficaJly: 



■7< 



Deleted: □ non-ASCII "garbage" at the beginning/end of files; □ secretary initials/filename at end of file; 
□ page numbers throughout text; □ other invalid text, such as . 



inserted mandatory headings, specifically; 



# 2 V 



Corrected an obvious error in the response, specifically; 



Edited identifiers where upper case is used but lower case is required, or vice versa. 
Corrected an error in the Number of Sequences field, specifically: 



A "Hard Page Break" code was inserted by the applicant. All occurrences had to be deleted. 

Deleted ending stop codon in amino acid sequences and adjusted the "(A)Length:' field accordingly (error 
due to a Patentln bug). Sequences corrected: . 

Other: 



'Examiner: The above corrections must be communicated to the applicant in the first Office 

Actfon. DO NOT send a copy of this form. 3/1/95 



1 



Page 1 of 7 



OIPE 



RAW SEQUENCE LISTING DATE: 06/29/2001 

PATENT APPLICATION: US/09/751 , 299A TIME: 14:23:08 

Input Set : A:\DIVER1440-2final.txt 
Output Set: N:\CRF3\06292001\l751299A.raw 

3 <110> APPLICANT: Madden, Mark 

4 Weiner, David P. 

5 Chaplin, Jennifer A. 

7 <120> TITLE OF INVENTION: METHODS FOR PRODUCING ENANT I OMERICALLY PURE 

8 ALPHA-SUBSTITUTED CARBOXYLIC ACIDS 
10 <130> FILE REFERENCE: DIVER1440-2 

12 <140> CURRENT APPLICATION NUMBER: US 09/751, 299A 

13 <141> CURRENT FILING DATE: 2000-12-28 

15 <150> PRIOR APPLICATION NUMBER: 60/254,414 

16 <151> PRIOR FILING DATE: 2000-12-07 

18 <150> PRIOR APPLICATION NUMBER: 60/173,609 

19 <151> PRIOR FILING DATE: 1999-12-29 
21 <160> NUMBER OF SEQ ID NOS : 4 

23 <170> SOFTWARE: Patent In Ver. 2.1 

25 <210> SEQ ID NO: 1 

26 <211> LENGTH: 1041 

27 <212> TYPE: DNA 

28 <213> ORGANISM: Unknown Organism 

30 <220> FEATURE: 

31 <223> OTHER INFORMATION: Description of Unknown Organism: Obtained from an 

32 environmental sample 

34 <220> FEATURE: 

35 <221> NAME /KEY: CDS 

36 <222> LOCATION: (1)..(1041) 
38 <4 00> SEQUENCE: 1 



39 


atg 


teg 


gag 


ccc 


atg 


acg 


aag 


tat 


cgc 


ggc 


gcg 


gcg 


gtg 


cag 


gee 


gcg 


48 


40 


Met 


Ser 


Glu 


Pro 


Met 


Thr 


Lys 


Tyr 


Arg 


Gly Ala 


Ala 


Val 


Gin 


Ala 


Ala 




41 


1 








5 










10 










15 






43 


ccg 


gtg 


ttc 


etc 


gat 


etc 


gac 


cgc 


aca 


gtc 


gag 


aaa 


gcg 


ate 


ggc 


ctg 


96 


44 


Pro 


Vai 


Phe 


Leu 


Asp 


Leu 


Asp 


Arg 


Thr 


Val 


Glu 


Lys 


Ala 


He 


Gly 


Leu 




45 








20 










25 










30 








47 


ate 


gag 


cag 


gcg 


gee 


aag 


cag 


gac 


gtg 


cgc 


ctg 


ate 


gca 


ttc 


cca 


gag 


144 


48 


He 


Glu 


Gin 


Ala 


Ala 


Lys 


Gin 


Asp 


Val 


Arg 


Leu 


He 


Ala 


Phe 


Pro 


Glu 




49 






35 










40 










45 










51 


act 


tgg 


att 


ccc 


ggc 


tat 


ccc 


ttt 


tgg 


ata 


tgg 


ctg 


ggc 


gcg 


ccg 


get 


192 


52 


Thr 


Trp 


He 


Pro 


Gly 


Tyr 


Pro 


Phe 


Trp 


He 


Trp 


Leu 


Gly 


Ala 


Pro 


Ala 




53 




50 










55 










60 












55 


tgg 


ggc 


atg 


cgc 


ttc 


gtc 


cag 


cgc 


tat 


ttc 


gag 


aat 


teg 


etc 


gtg 


cgc 


240 


56 


Trp 


Gly 


Met 


Arg 


Phe 


Val 


Gin 


Arg 


Tyr 


Phe 


Glu 


Asn 


Ser 


Leu 


Val 


Arg 




57 


65 










70 










75 










80 




59 


ggc 


age 


aag 


cag 


tgg 


cag 


gee 


ctg 


gcg 


gat 


gcg 


gec 


cgc 


cgc 


cac 


ggc 


288 


60 


Gly 


Ser 


Lys 


Gin 


Trp 


Gin 


Ala 


Leu 


Ala 


Asp 


Ala 


Ala 


Arg 


Arg 


His 


Gly 




61 










85 










90 










95 






63 


atg 


cat 


gtc 


gtg 


gee 


ggc 


tat 


age 


gag 


cgc 


gcg 


ggc 


ggc 


age 


etc 


tat 


336 


64 


Met 


His 


Val 


Val 


Ala 


Gly 


Tyr 


Ser 


Glu 


Arg 


Ala 


Gly 


Gly 


Ser 


Leu 


Tyr 




65 








100 










105 










110 








67 


atg 


ggc 


cag 


gcg 


ate 


ttc 


ggc 


ccc 


gat 


ggc 


gat 


ctg 


ate 


gec 


gcg 


cgc 


384 



file://C:\Crf3\Outhold\VsrI75 1 299A.htm 



6/29/01 



Page 2 of 7 

t 

i 



RAW SEQUENCE LISTING DATE : 06/29/2001 

PATENT APPLICATION: US/09/751 , 299A TIME: 14:23:08 

Input Set : A:\DIVER1440-2final.txt 
Output Set: N:\CRF3\06292001\l751299A.raw 



68 


Met 


Gly 


Gin 


Ala 


He 


Phe 


Gly 


Pro 


Asp 


Gly 


Asp 


Leu 


He 


Ala 


Ala 


Arg 




69 






115 










120 










125 










71 


CQC 


aag 


etc 


aag 


cct 


acc 


cat 


qcq 

3 3 


qaq 

3 3 


CQC 

3 


acc 


qtq 

3 3 


ttc 


qqc 

3 3 


qaq 

3 3 


qqa 


432 


72 


Arq 


Lys 


Leu 


Lys 


Pro 


Thr 


His 


Ala 


Glu 


Arg 


Thr 


Val 


Phe 


Gly 


Glu 


Gly 




73 




130 










135 










140 












75 


gac 


QQC 

3 3 w 


age 


cat 


etc 


qcq 


qtq 


cac 


gat 


acc 


qcc 


ate 


qqq 

3 3 3 


cqc 


etc 


qqc 

3 3^ 


480 


76 


Asp 


Gly 


Ser 


His 


Leu 


Ala 


Val 


His 


Asp 


Thr 


Ala 


He 


Gly 


Arg 


Leu 


Gly 




77 


145 










150 










155 










160 




79 


QCQ 

3 ^** 3 


etc 


tqt 


t gc 


tqq 


qaq 

3 *-* 3 


cac 


ate 


cag 


cca 


ttg 


tea 


aaa 


tac 


gee 


atg 


528 


80 


Ala 


Leu 


Cys 


Cys 


Trp 

It 


Glu 


His 


He 


Gin 


Pro 


Leu 


Ser 


Lys 


Tyr 


Ala 


Met 




81 










165 










170 










175 






83 


tac 


gec 


gee 


aac 

l-J W V- 


gaa 


cag 


ate 


cac 


ate 


aca 

3 ^ 3 


teg 


taa 


ccg 


age 


ttc 


aac 

«H-* V-i V 


576 


84 


Tyr 


Ala 


Ala 


Asp 


Glu 


Gin 


Val 


His 


Val 


Ala 


Ser 


Trp 


Pro 


Ser 


Phe 


Ser 




85 








180 










185 










190 








87 


etc 


tat 


cgc 


aac 

3 3 


atg 


qcc 

3 ^ ^ 


tat 


acq 

3 3 


etc 


aaa 

3 3 1-4 


ccg 


aaa 

3 " 3 


gtc 


aat 


acc 


gee 


624 


88 


Leu 


Tyr 


Arg 


Gly 


Met 


Ala 


Tyr 


Ala 


Leu 


Gly 

Jr 


Pro 


Glu 


Val 


Asn 


Thr 


Ala 




89 






195 










200 










205 










91 


aca 


aac 


caa 


ate 


tac 


aca 

3 ^ 3 


gtc 


aaa 

3^3 


aac 

3 3^" 


aac 

3 3^" 


tac 


tac 


at a 

3^3 


ctg 


aca 

3 ^3 


tea 


672 


92 


Ala 


Ser 


Gin 


He 


Tyr 


Ala 


Val 


Glu 


Gly 


Gly 


Cys 


Tyr 


Val 


Leu 


Ala 


Ser 




93 




210 

.j. 










215 










220 












95 


tqc 


qcq 


ace 


gtt 


teg 


ccq 


qaq 


atg 


ate 


aag 


qt a 


ttq 


qtq 

3 3 


gat 


acg 


ccc 


720 


96 


Cys 


Ala 


Thr 


Val 


Ser 


Pro 


Glu 


Met 


He 


Lys 


Val 


Leu 


Val 


Asp 


Thr 


Pro 




97 


225 










230 










235 










240 




99 


qac 


aag 


qaq 

3 3 


atg 


ttc 


etc 


aag 


qcc 


qqc 

3 3 


qqc 

3 3 


qqt 

3 3 


ttt 


qcc 


atg 


att 


ttc 


768 


100 


Asp 


Lys 


Glu 


Met 


Phe 


Leu 


Lys 


Ala 


Gly 


Gly 


Gly 


Phe 


Ala 


Met 


He 


Phe 




101 










245 










250 










255 






103 


3 3 3 


ccc 


gac 


ggc 


cgc 


qcc 


ctq 


qcc 


qaq 

3 3 


ccq 


etc 


ccq 


qaq 

3 3 


acc 


gaa 


qaq 

3 3 


816 


104 


Gly 


Pro 


Asp 


Gly Arg 


Ala 


Leu 


Ala 


Glu 


Pro 


Leu 


Pro 


Glu 


Thr 


Glu 


Glu 




105 








260 










265 










270 








107 


qqa 


ctg 


ctg 


gtc 


gee 


gat 


ate 


gac 


etc 


qqc 

3 3 


atg 


ate 


qcq 

3^3 


ttg 


gee 


aag 


864 


108 


Gly 


Leu 


Leu 


Val 


Ala 


Asp 


He 


Asp 


Leu 


Gly 


Met 


He 


Ala 


Leu 


Ala 


Lys 




109 






275 










280 










285 










111 


aca 
^ ^ y 


aca 


gee 


gat 


ccg 


aca 


aac 

3 3 ^ 


cac 


tat 


tea 


caa 


ccc 


gac 


gt a 


acg 


caa 


912 


112 


Ala 


Ala 


Ala 


Asp 


Pro 


Ala 


Glv 


His 


Tyr 


Ser 


Arg 


Pro 


Asp 


Val 


Thr 


Arg 




113 




290 










295 










300 












115 


ctg 


ctg 


ctg 


gat 


cga 


cat 


cca 

V** V^F U 


acc 


caa 

V*i* KJL (wit 


cac 

Vw \*A 


ate 


at c 


aca 

\*t. x** ^4 


ctt 


aat 


acc 


960 

~S V/ w 


116 


Leu 


Leu 


Leu 


Asp 


Arg 


Arg 


Pro 


Ala 


Gin 


Arg 


Val 


Val 


Thr 


Leu 


Asp 


Ala 




117 

XX/ 


305 










310 










ji j 










o ^ VJ 




119 


gca 


ttc 


gaa 


ccg 


caa 


aac 


gag 


gac 


aag 


ggc 


gac 


gcg 


ccc 


gcg 


ctg 


cgc 


1008 


120 


Ala 


Phe 


Glu 


Pro 


Gin 


Asn 


Glu 


Asp 


Lys 


Gly 


Asp 


Ala 


Pro 


Ala 


Leu 


Arg 




121 










325 










330 










335 






123 


gtg 


gtg 


gcg 


gaa 


age 


gee 


gee 


gee 


gcg 


cag 


tag 












1041 


124 


Val 


Val 


Ala 


Glu 


Ser 


Ala 


Ala 


Ala 


Ala 


Gin 
















125 








340 










345 


















128 


<210> SEQ ID NO 


: 2 


























129 


<211> LENGTH: 346 


























130 


<212> TYPE: 


PRT 




























131 


<213> ORGANISM: 


Unknown 


Organism 


















W— > 132 


<220> FEATURE: 





























file://C:\Crfi\Outhold\VsrI751299A.htm 



6/29/01 



I 



f 



Page 3 of 7 



RAW SEQUENCE LISTING DATE: 06/29/2001 

PATENT APPLICATION: US/09/751 , 2 99A TIME: 14:23:08 

Input Set : A:\DIVER1440-2final.txt 
Output Set: N:\CRF3\06292001\l751299A.raw 



132 

-i<- ij a* 


<223> OTHER 


INFORMATION 


: Descriptio 


135 


<4 00> SEQUENCE: 


2 










136 


Met 


Ser 


Glu 


Pro 


Met 


Thr 


Lys 


Tyr 


Arg 


137 


1 








5 










138 

J. *-j \j 


Pro 


Val 


Phe 


Leu 


Asp 


Leu 


Asp 


Arg 


Thr 


139 








20 










25 


14 0 

X T V/ 


lie 


Glu 


Gin 


Ala 


Ala 


Lys 


Gin 


Asp 


Val 


141 






35 










40 




142 

X T t- 


Thr 


Trp 


lie 


Pro 


Gly 


Tyr 


Pro 


Phe 


Trp 


143 




50 










55 






14 4 

X ~ a 


Trp 


Glv 


Met 


Arg 


Phe 


Val 


Gin 


Arg 


Tyr 


145 


65 










70 








J. 1 u 


Glv 


Ser 


Lvs 


Gin 


Trp 


Gin 


Ala 


Leu 


Ala 


147 

XT r 










85 










14 8 

X 1 u 


Met 


His 


Val 


Val 


Ala 


Gly 


Tyr 


Ser 


Glu 


14 9 








100 










105 


1 SO 

X *J \J 


Met 


Glv 


Gin 


Ala 


He 


Phe 


Gly 


Pro 


Asp 


151 

_L _L 






115 










120 




X *J 


Arg 


Lys 


Leu 


Lvs 


Pro 


Thr 


His 


Ala 


Glu 


X *J 




130 










135 






154 


Asp 


Glv 


Ser 


His 


Leu 


Ala 


Val 


His 


Asp 


155 


145 










150 








156 


Ala 


Leu 


Cys 


Cys 


Trp 


Glu 


His 


He 


Gin 


157 










165 










158 

X ^ (J 


Tvr 


Ala 


Ala 


Asp 


Glu 


Gin 


Val 


His 


Val 


159 








180 










185 


1 60 


Leu 


Tvr 


Arg 


Gly 


Met 


Ala 


Tyr 


Ala 


Leu 


161 

J. W -L. 






195 










200 




162 


Ala 


Ser 


Gin 


He 


Tyr 


Ala 


Val 


Glu 


Gly 


163 




210 










215 






164 


Cys. 


Ala 


Thr 


Val 


Ser 


Pro 


Glu 


Met 


He 


165 

X \J <u/ 


225 










230 








166 

JL U U 


Asp 


Lys 


Glu 


Met 


Phe 


Leu 


Lys 


Ala 


Gly 


167 

X \J 1 










245 










168 

X \J \J 


Gly 


Pro 


Asp 


Gly Arg 


Ala 


Leu 


Ala 


Glu 


169 








260 










265 


170 

X / \J 


Gly 


Leu 


Leu 


Val 


Ala 




T 1 p 


A 

no ks 




1 "7 1 
1/1 






275 










280 




172 


Ala 


Ala 


Ala 


Asp 


Pro 


Ala 


Gly 


His 


Tyr 


173 




290 










295 






174 


Leu 


Leu 


Leu 


Asp 


Arg 


Arg 


Pro 


Ala 


Gin 


175 


305 










310 








176 


Ala 


Phe 


Glu 


Pro 


Gin 


Asn 


Glu 


Asp 


Lys 


177 










325 










178 


Val 


Val 


Ala 


Glu 


Ser 


Ala 


Ala 


Ala 


Ala 


179 








340 










345 


183 


<210> SEQ ID NO: 


3 










184 


<211> LENGTH: 1014 










185 


<212> TYPE: 


DNA 













of Unknown Organism: Obtained from an 



Glv 

oxy 


Ala 


Ala 


Va 1 

V CL X 


v_> XII 


Al a 

n x ci 


Al a 

ni ci 


10 










15 

X 




Val 


Glu 


T ,\7 Q 


Ala 


TIp 
lie 


fll \7 


i_i t; Li 










30 






Arg 


Leu 


T 1 P 

-L -L C 


Al a 


Php 


pro 

tr x \J 


r;i n 

w -L Li 








45 








He 


Trp 


T .pii 


Gly 


Ala 


Drn 

XT x, \J 


A 1 A 
^i_L ci 






60 










Phe 


Glu 




Ser 


Leu 


v ex. J. 


A rrr 
rt x y 




75 












Asp 


Ala 


Al a 


Arg 


Arg 


Hi c; 

11 X. o 


ftl \7 


90 














Arg 


Ala 




Gly 


Ser 


T .01 i 


T\/r* 

j. yx 










110 






Gly Asp 


Jjcu 


He 


Ala 


Al a 


rtx y 








125 








Arg 


Thr 


v a. x 


Phe 


Gly 


O -L Li 


HI V 






14 0 

X *3 \J 










Thr 


Ala 


T 1 p 


Gly Arg 


T >P1 1 


Cl v 
<oxy 




155 










i fin 


Pro 


Leu 


C X 


Lys 


Tyr 


Ala 

ilX Cl 


Mpr 

nc L 


170 










175 




Ala 


Ser 


Trn 


Pro 


Ser 


Php 


O G x 










190 






Gly 


Pro 


fll n 

Ox Li 


Val 


Asn 


Thr* 


r\ x cl 








205 








Gly 


Cys 


T\7T 


Val 


Leu 


Al a 

/i X Cl 


Qpr 
O Cl 






220 

L V 










Lys 


Val 


T .PIT 

J_l ~ KJL 


Val 


Asp 


Thr 


Prn 

X X w 




235 










d. 4 U 


Gly 


Gly 


Phe 


Ala 


Met 


He 


Phe 


250 










255 




Pro 


Leu 


Pro 


Glu 


Thr 


Glu 


Glu 










270 






Gly 


Met 


He 


Ala 


Leu 


Ala 


Lys 








285 








Ser 


Arg 


Pro 


Asp 


Val 


Thr 


Arg 






300 










Arg 


Val 


Val 


Thr 


Leu 


Asp 


Ala 




315 










320 


Gly Asp 


Ala 


Pro 


Ala 


Leu 


Arg 


330 










335 




Gin 















file://C:\CrO\Outhold\VsrI75 1 299A.htm 



6/29/01 



Page 4 of 7 



RAW SEQUENCE LISTING DATE : 06/29/2001 

PATENT APPLICATION: US/09/751 , 299A TIME: 14:23:08 

Input Set : A:\DIVER1440-2final.txt 
Output Set: N:\CRF3\06292001\l751299A.raw 



186 


<213> ORGANISM: 


Unknown 


Organism 


















188 

J. KJ Kj 


<220> FEATURE: 




























189 


<223> OTHER 


INFORMATION 


: Description of 


Unknown 


Organism: Obtained from 


190 




environmental 


sample 






















192 

-L -X £m+ 


<220> FEATURE: 




























193 


<221> NAME/KEY: 


CDS 


























194 


<222> LOCATION: 


(1) 


. . (1014) 






















196 


<400> SEQUENCE: 


3 


























197 


ata 


aaa 


gaa 


get 


ate 


aag 


gtc 


gec 


tgc 


ata 


caa 


gee 


acc 


ccg 


ate 


tac 


48 


198 


Met 


Lys 


Glu 


Ala 


He 


Lys 


Val 


Ala 


Cys 


Val 


Gin 


Ala 


Ala 


Pro 


He 


Tyr 




199 

A- *S 


1 








5 










10 










15 






201 

Li V/ JL. 


ata 


gat 


ttg 


aaa 


acq 

3 3 


acg 


gtg 


gac 


aaa 


acc 


att 


aaa 


ttg 


atg 


gaa 


gaa 


96 


202 


Met 


Asd 


Leu 


Glu 

VJ -i. 1-* 


Ala 


Thr 


Val 


Asp 


Lys 


Thr 


He 


Glu 


Leu 


Met 


Glu 


Glu 




203 

V/ 








20 










25 










30 








205 


gca 


gca 


cat 


aat 


aat 


get 


cat 

3 


ctg 


ate 


acc 


ttt 


ccg 


gaa 


act 


taa 

^ 3 3 


att 


144 


206 


Ala 


Ala 


Ara 


A<?n 


Asn 


Ala 


Arg 


Leu 


He 


Ala 


Phe 


Pro 


Glu 


Thr 


Trp 


He 




207 






35 










40 










45 










209 


L> C*l 




t ri C 




taa 

*~ 3 3 


ttt 


ctt 


taa 


ctt 


aac 

y a. 


tea 


cca 


aca 


taa 


aca 

\~4 (—4 


ata 


192 

mlrn- _S ^mm 


210 


Pro 


Gl v 




Prn 

IT J- w 


Tro 


Phe 


Leu 


Trp 


Leu 


Asp 


Ser 


Pro 


Ala 


Trp 

tr 


Ala 


Met 




211 




50 










55 










60 












i j 




ttt 

l»- Nu- 


ata 


rac 


caa 


tac 


cat 


aaa 


aac 


tea 


tta 


aaa 


ttg 


aat 


aac 
yy u 


cct 


240 


214 


Gin 

NwJ J* 11 


Phe 


Val 


Ara 


Gin 


Tyr 


His 


Glu 


Asn 


Ser 


Leu 


Glu 


Leu 


Asp 


Gly 


Pro 




215 

jl* J_ 


65 










70 










75 










80 




217 


caa 


act 


aaa 

U ^4 Vh4 


cac 


att 


tea 


gat 


gca 


gec 


aag 


caa 


tta 


aaa 


ate 


ata 


gtc 


288 


218 

-I- w 


Gin 


Ala 


Lvs 


Ara 


He 


Ser 


Asp 


Ala 


Ala 


Lvs 


Arg 


Leu 


Gly 


He 


Met 


Val 




219 










85 










90 










95 






221 


acc 


eta 


aaa 


ata 


agt 


gaa 


egg 

3 3 


ate 


aat 


aac 
yy° 


acc 


ctt 


tac 


ate 


agt 


cag 


336 


222 


Thr 


Leu 

J-J 


Glv 


Met 


Ser 


Glu 


Arg 


Val 


Gly 


Gly 


Thr 


Leu 


Tyr 


He 


Ser 


Gin 




223 








100 










105 










110 








225 


too 
3 3 


ttc 


ata 


aac 


qat 


aat 


qqt 


gac 


acc 


att 


aaa 


gee 


caa 


cga 


aaa 


ttg 


384 


226 


Trp 


Phe 


He 


Gly 


Asp 


Asn 


Gly 


Asp 


Thr 


He 


Gly 


Ala 


Arg 


Arg 


Lvs 


Leu 




227 






115 










120 










125 










229 


aaa 


cct 


act 


ttt 


qtt 


gaa 


cgt 


act 


ttg 


ttc 


aac 


gaa 


aaa 


gat 


aat 

3 3 s " 


tea 


432 


230 


Lys 


Pro 


Thr 


Phe 


Val 


Glu 


Arg 


Thr 


Leu 


Phe 


Gly 


Glu 


Gly 


Asp 


Gly 


Ser 




231 




130 










135 










140 












233 


teg 


eta 


gcg 


gtt 


ttc 


gag 


acg 


tct 


gtt 


gga 


agg 


ctg 


ggt 


ggc 


tta 


tgc 


480 


234 


Ser 


Leu 


Ala 


Val 


Phe 


Glu 


Thr 


Ser 


Val 


Gly 


Arg 


Leu 


Gly 


Gly 


Leu 


Cys 




235 


145 










150 










155 










160 




237 


tgt 


tgg 


gag 


cac 


ctt 


caa 


ccg 


eta 


aca 


aaa 


tac 


get 


ttg 


tat 


gca 


caa 


528 


238 


Cys 


Trp 


Glu 


His 


Leu 


Gin 


Pro 


Leu 


Thr 


Lys 


Tyr 


Ala 


Leu 


Tyr 


Ala 


Gin 




239 










165 










170 










175 






241 


aat 


gaa 


gag 


att 


cat 


tgt 


gcg 


get 


tgg 


ccg 


age 


ttt 


age 


ctt 


tat 


cct 


576 


242 


Asn 


Glu 


Glu 


He 


His 


Cys 


Ala 


Ala 


Trp 


Pro 


Ser 


Phe 


Ser 


Leu 


Tyr 


Pro 




243 








180 










185 










190 








245 


aat 


gcg 


gcg 


aaa 


gec 


ctg 


ggg 


cct 


gat 


gtc 


aat 


gta 


gcg 


gee 


tct 


cga 


624 


246 


Asn 


Ala 


Ala 


Lys 


Ala 


Leu 


Gly 


Pro 


Asp 


Val 


Asn 


Val 


Ala 


Ala 


Ser 


Arg 




247 






195 










200 










205 










249 


ate 


tat 


gec 


gtt 


gaa 


ggg 


caa 


tgc 


ttc 


gta 


eta 


gcg 


teg 


tgt 


gcg 


etc 


672 


250 


He 


Tyr 


Ala 


Val 


Glu 


Gly 


Gin 


Cys 


Phe 


Val 


Leu 


Ala 


Ser 


Cys 


Ala 


Leu 
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251 




210 

Crf -J- \p/ 










215 










220 












253 


crtt 

^rf- ^ 


tea 


caa 


tec 


at a 


ate 


era t 


a t cr 


ctt 


tat 


a pa 

O CI 


y o i_ 


nap 

y a ^w* 


na a 

y ci a 


a a n 

a a y 


p a i~ 

\-« Cl L. 


720 


254 


Val 


Ser 


Gin 


Ser 


Met 


lie 


Aso 


Met 


Leu 


Cvs 


Thr 


Aso 


Asd 


Glu 


L vs 

-1—1 V w 


His 




255 


225 

b fari s> 










230 

fc— ^ V/ 










235 










240 

i-, T W 




257 


y v_y 


tta 

i_ u y 


ctt 




net 




yy L 


rrna 
y y a 


pap 

CI 


1— \— a 


prri* 


a 1" p 


a t" a 


y y y 


p p +* 


p^ "h 
y ci i_ 


7 68 


258 


Ala 


Leu 


Leu 


Leu 


Ala 


Glv 

\_i -i_ 


Glv 


Glv 


His 


Ser 


Arg 


He 


He 


Glv 


Pro 


Asn 




259 










245 

Chi 4 V-* 










250. 










255 

'J -J 






261 


not 




crap 

>-H C* V-f 


tta 


crt p 


y °y 


ppt* 


ctt 


npp 

y ^ v — • 


\-4 d d 


a a t 


rra a 


na n 
y ay 


y y 


att 


p t" p 


816 


262 


Glv 


Gly 


Asp 


Leu 


Val 


Ala 


Pro 


Leu 


Ala 


Glu 


Asn 


Glu 


Glu 


Glv 


He 

^hU- >w 


Leu 




263 








260 

b w vy 










265 










270 








265 


fan 

L, CI 


u La 


CI Ci >w/ 




y d u 




y y d 




r* n c 




ctt 


get 


3 3 ^5 
aaa 


a T_g 




n c T\ 
y La 


OUT 


266 


Tvr 


Ala 

xaX CI 


A^n 


T.pii 
-LJ ~ u 




Pro 


G3 v 

vj _l _y 


V CI -I. 


Arrr 
^ix y 


x x c 


Leu 


Ala 






Al a 

n x a 


Ala 




267 






275 










280 










285 










269 


gac 


cct 


get 


ggt 


cat . 


tat 


tec 


cgt 


ccc 


gac 


att 


act 


cgc 


ttg 


eta 


ata 


912 


270 


Asp 


Pro 


Ala 


Gly 


His 


Tyr 


Ser 


Arg 


Pro 


Asp 


He 


Thr 


Arg 


Leu 


Leu 


He 




271 




290 










295 










300 












273 


gat 


cgc 


age 


cct 


aaa 


tta 


ccg 


gta 


gtt 


gaa 


att 


gaa 


ggt 


gat 


ctt 


cgt 


960 


274 


Asp 


Arg 


Ser 


Pro 


Lys 


Leu 


Pro 


Val 


Val 


Glu 


He 


Glu 


Gly 


Asp 


Leu 


Arg 




275 


305 










310 










315 










320 




277 


cct 


tac 


get 


ttg 


ggt 


aaa 


gcg 


tct 


gag 


acg 


ggt 


gcg 


caa 


etc 


gaa 


gaa 


1008 


278 


Pro 


Tyr 


Ala 


Leu 


Gly 


Lys 


Ala 


Ser 


Glu 


Thr 


Gly Ala 


Gin 


Leu 


Glu 


Glu 




279 










325 










330 










335 






281 


att 


tga 






























1014 



282 He 

285 <210> SEQ ID NO: 4 

286 <211> LENGTH: 337 

287 <212> TYPE: PRT 

288 <213> ORGANISM: Unknown Organism 
W — > 289 <220> FEATURE: 

28 9 <223> OTHER INFORMATION: Description of Unknown Organism: Obtained from 
292 <400> SEQUENCE: 4 



293 


Met 


Lys 


Glu 


Ala 


He 


Lys 


Val 


Ala 


Cys 


Val 


Gin 


Ala 


Ala 


Pro 


He 


Tyr 


294 


1 








5 










10 










15 




295 


Met 


Asp 


Leu 


Glu 


Ala 


Thr 


Val 


Asp 


Lys 


Thr 


He 


Glu 


Leu 


Met 


Glu 


Glu 


296 








20 










25 










30 






297 


Ala 


Ala 


Arg 


Asn 


Asn 


Ala 


Arg 


Leu 


He 


Ala 


Phe 


Pro 


Glu 


Thr 


Trp 


He 


298 






35 










40 










45 








299 


Pro 


Gly 


Tyr 


Pro 


Trp 


Phe 


Leu 


Trp 


Leu 


Asp 


Ser 


Pro 


Ala 


Trp 


Ala 


Met 


300 




50 










55 










60 










301 


Gin 


Phe 


Val 


Arg 


Gin 


Tyr 


His 


Glu 


Asn 


Ser 


Leu 


Glu 


Leu 


Asp 


Gly 


Pro 


302 


65 










70 










75 










80 


303 


Gin 


Ala 


Lys 


Arg 


He 


Ser 


Asp 


Ala 


Ala 


Lys 


Arg 


Leu 


Gly 


He 


Met 


Val 


304 










85 










90 










95 




305 


Thr 


Leu 


Gly 


Met 


Ser 


Glu 


Arg 


Val 


Gly 


Gly 


Thr 


Leu 


Tyr 


He 


Ser 


Gin 


306 








100 










105 










110 






307 


Trp 


Phe 


He 


Gly 


Asp 


Asn 


Gly 


Asp 


Thr 


He 


Gly 


Ala 


Arg 


Arg 


Lys 


Leu 


308 






115 










120 










125 








309 


Lys 


Pro 


Thr 


Phe 


Val 


Glu 


Arg 


Thr 


Leu 


Phe 


Gly 


Glu 


Gly 


Asp 


Gly 


Ser 



310 130 135 140 
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3 
4 
5 
7 
8 



<110> 



<120> 



APPLICANT: Madden, Mark 
Weiner, David P. 
Chaplin, Jennifer A. 

TITLE OF INVENTION: METHODS FOR PRODUCING 
ALPHA-SUBSTITUTED CARBOXYLIC ACIDS 



ENANTIOMERICALLY PURE 



10 <130> FILE REFERENCE: DIVER1440-2 
12 <140> CURRENT APPLICATION NUMBER: US 09/751,299 
C— > 13 <141> CURRENT FILING DATE: 2001-05-01 

15 <150> PRIOR APPLICATION NUMBER: 60/254,414 

16 <151> PRIOR FILING DATE: 2000-12-07 

18 <150> PRIOR APPLICATION NUMBER: 60/173,609 

19 <151> PRIOR FILING DATE: 1999-12-29 
21 <160> NUMBER OF SEQ ID NOS : 4 

23 <170> SOFTWARE: Patent In Ver. 2.1 

25 <210> SEQ ID NO: 1 

26 <211> LENGTH: 1041 

27 <212> TYPE: DNA 

28 <213> ORGANISM: Unknown Organism 

30 <220> FEATURE: 

31 <223> OTHER INFORMATION: Description of Unknown Organism: Obtained from an 

32 environmental sample 

34 <220> FEATURE: 

35 <221> NAME /KEY : CDS 

36 <222> LOCATION: (1)..(1041) 



38 


<400> SEQUENCE: 


1 


























39 


atg 


teg 


gag 


ccc 


atg 


acg 


aag 


tat 


cgc 


ggc 


gcg 


gcg 


gtg 


cag 


gee 


gcg 


48 


40 


Met 


Ser 


Glu 


Pro 


Met 


Thr 


Lys 


Tyr 


Arg 


Gly 


Ala 


Ala 


Val 


Gin 


Ala 


Ala 




41 


1 








5 










10 










15 






43 


ccg 


gtg 


ttc 


etc 


gat 


etc 


gac 


cgc 


aca 


gtc 


gag 


aaa 


gcg 


ate 


ggc 


ctg 


96 


44 


Pro 


Val 


Phe 


Leu 


Asp 


Leu 


Asp 


Arg 


Thr 


Val 


Glu 


Lys 


Ala 


He 


Gly 


Leu 




45 








20 










25 










30 








47 


ate 


gag 


cag 


gcg 


gec 


aag 


cag 


gac 


gtg 


cgc 


ctg 


ate 


gca 


ttc 


cca 


gag 


144 


48 


He 


Glu 


Gin 


Ala 


Ala 


Lys 


Gin 


Asp 


Val 


Arg 


Leu 


He 


Ala 


Phe 


Pro 


Glu 




49 






35 










40 










45 










51 


act 


tgg 


att 


ccc 


ggc 


tat 


ccc 


ttt 


tgg 


ata 


tgg 


ctg 


ggc 


gcg 


ccg 


get 


192 


52 


Thr 


Trp 


He 


Pro 


Gly 


Tyr 


Pro 


Phe 


Trp 


He 


Trp 


Leu 


Gly 


Ala 


Pro 


Ala 




53 




50 










55 










60 












55 


tgg 


ggc 


atg 


cgc 


ttc 


gtc 


cag 


cgc 


tat 


ttc 


gag 


aat 


teg 


etc 


gtg 


cgc 


240 


56 


Trp 


Gly 


Met 


Arg 


Phe 


Val 


Gin 


Arg 


Tyr 


Phe 


Glu 


Asn 


Ser 


Leu 


Val 


Arg 




57 


65 










70 










75 










80 




59 


ggc 


age 


aag 


cag 


tgg 


cag 


gec 


ctg 


gcg 


gat 


gcg 


gec 


cgc 


cgc 


cac 


ggc 


288 


60 


Gly 


Ser 


Lys 


Gin 


Trp 


Gin 


Ala 


Leu 


Ala 


Asp 


Ala 


Ala 


Arg 


Arg 


His 


Gly 




61 










85 










90 










95 






63 


atg 


cat 


gtc 


gtg 


gec 


ggc 


tat 


age 


gag 


cgc 


gcg 


ggc 


ggc 


age 


etc 


tat 


336 


64 


Met 


His 


Val 


Val 


Ala 


Gly 


Tyr 


Ser 


Glu 


Arg 


Ala 


Gly 


Gly 


Ser 


Leu 


Tyr 




65 








100 










105 










110 









67 atg ggc cag gcg ate ttc ggc ccc gat ggc gat ctg ate gec gcg cgc 384 
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68 


Met 


Gly 


Gin 


Ala 


He 


Phe 


Gly 


Pro 


Asp 


Gly 


Asp 


Leu 


He 


Ala 


Ala 


Arg 


69 






115 










120 










125 








71 


cgc 


aaq 


etc 


aag 


cct 


acc 


cat 


gcg 

3 3 


gag 


cgc 


acc 


gtg 


ttc 


ggc 

3 3 


qaq 

3 3 


qqa 

3 3 


72 


Arg 


Lys 


Leu 


Lys 


Pro 


Thr 


His 


Ala 


Glu 


Arg 


Thr 


Val 


Phe 


Gly 


Glu 


Gly 


73 




130 










135 










140 










75 


gac 


qqc 

33 


aqc 


cat 


etc 


qcg 

3 3 


gtg 

3 3 


cac 


gat 


acc 


gee 

3 


ate 


qqq 

3 3 3 


cqc 


etc 


qqc 


76 


Asp 


Gly 


Ser 


His 


Leu 


Ala 


Val 


His 


Asp 


Thr 


Ala 


He 


Gly 


Arg 


Leu 


Gly 


77 


145 










150 










155 










160 


79 


acq 

3 3 


etc 


tgt 


tqc 


tgq 

3 3 


qaq 

3 3 


cac 


ate 


cag 


cca 


ttg 


teg 


aaa 


tac 


qcc 


atg 


80 


Ala 


Leu 


Cys 


Cys 


Trp 


Glu 


His 


He 


Gin 


Pro 


Leu 


Ser 


Lys 


Tyr 


Ala 


Met 


81 










165 










170 










175 




83 


tac 


qcc 


qcc 


gac 


gaa 


cag 


qtc 


cac 


qtc 

3 


qcq 

3 3 


tcq 


tgg 

3 3 


ccq 


age 


ttc 


age 


84 


Tyr 


Ala 


Ala 


Asp 


Glu 


Gin 


Val 


His 


Val 


Ala 


Ser 


Trp 


Pro 


Ser 


Phe 


Ser 


85 








180 










185 










190 






87 


etc 


tat 


cgc 


ggc 

3 3 


atg 


gee 


tat 


gcg 

3 3 


etc 


gga 

3 3 


ccg 


gag 

3 3 


gtc 

3 


aat 


acc 


qcc 


88 


Leu 


Tyr 


Arg 


Gly 


Met 


Ala 


Tyr 


Ala 


Leu 


Gly 


Pro 


Glu 


Val 


Asn 


Thr 


Ala 


89 






195 










200 










205 








91 


qca 


age 


cag 


ate 


tac 


gcg 


qtc 

3 


qaq 

3 3 


qqc 

3 3 


qqc 

3 3 


tqc 


tac 


qtq 

3 3 


ctq 


qcq 

3 3 


tcq 


92 


Ala 


Ser 


Gin 


lie 


Tyr 


Ala 


Val 


Glu 


Gly 


Gly 


Cys 


Tyr 


Val 


Leu 


Ala 


Ser 


93 




210 










215 










220 










95 


tgc 


gcg 

33 


acc 


gtt 


teg 


ccg 


Qag 

3 3 


atg 


ate 


aag 


gta 


ttg 


gtg 


gat 

3 


acg 


ccc 


96 


Cys 


Ala 


Thr 


Val 


Ser 


Pro 


Glu 


Met 


He 


Lys 


Val 


Leu 


Val 


Asp 


Thr 


Pro 


97 


225 










230 










235 










240 


99 


gac 


aag 


qaq 

3 3 


atg 


ttc 


etc 


aag 


gee 


ggc 

3 3 


ggc 

3 3 


ggt 

3 3 


ttt 


gec 


atg 


att 


ttc 


100 


Asp 


Lys 


Glu 


Met 


Phe 


Leu 


Lys 


Ala 


Gly 


Gly 


Gly 


Phe 


Ala 


Met 


He 


Phe 


101 










245 










250 










255 




103 


ggg 

-J 


ccc 


gac 


ggc 


cgc 


gee 


ctg 


gee 


gag 


ccg 


etc 


ccg 


gag 

3 3 


acc 


gaa 


gag 

3 3 


104 


Gly 


Pro 


Asp 

A. 


Gly Arg Ala 


Leu 


Ala 


Glu 


Pro 


Leu 


Pro 


Glu 


Thr 


Glu 


Glu 


105 








260 










265 










270 






107 


gga 
3 -j 


ctg 


ctg 


gtc 


gec 


gat 


ate 


gac 


etc 


qqc 

3 3 


atg 


ate 


qcq 

3 3 


ttg 


qcc 


aag 


108 


Gly 


Leu 


Leu 


Val 


Ala 


Asp 


He 


Asp 


Leu 


Gly 


Met 


He 


Ala 


Leu 


Ala 


Lys 


109 






275 










280 










285 








111 


acq 

3 3 


acq 


qcc 


gat 


ccg 


gcg 


qqc 


cac 


tat 


tea 


egg 


ccc 


gac 


gta 


acq 


caa 


112 


Ala 


Ala 


Ala 


Asp 


Pro 


Ala 


Gly 


His 


Tyr 


Ser 


Arg 


Pro 


Asp 


Val 


Thr 


Arg 


113 




290 










295 










300 










115 


ctg 


ctg 


ctg 


gat 


cga 


cgt 


ccg 


gee 


caa 


cgc 


gtc 


gtc 


acg 


ctt 


gat 


gec 


116 


Leu 


Leu 


Leu 


Asp 


Arg 


Arg 


Pro 


Ala 


Gin 


Arg 


Val 


Val 


Thr 


Leu 


Asp 


Ala 




305 










310 










315 










■J w 


119 


gca 


ttc 


gaa 


ccg 


caa 


aac 


gag 


gac 


aag 


ggc 


gac 


gcg 


ccc 


gcg 


ctg 


cgc 


120 


Ala 


Phe 


Glu 


Pro 


Gin 


Asn 


Glu 


Asp 


Lys 


Gly 


Asp 


Ala 


Pro 


Ala 


Leu 


Arg 


121 










325 










330 










335 




123 


gtg 


gtg 


gcg 


gaa 


age 


gee 


gee 


gee 


gcg 


cag 


tag 












124 


Val 


Val 


Ala 


Glu 


Ser 


Ala 


Ala 


Ala 


Ala 


Gin 












125 








340 










345 
















128 


<210> SEQ ID NO 


: 2 
























129 


<211> LENGTH: 346 
























130 


<212> TYPE: 


PRT 


























131 




^-ORGANISM: 


Unknown 


Organism 
















W— > 132 


V<220> FEATURE: 



























432 



480 



528 



576 



624 



672 



720 



768 



816 



864 



912 



960 



1008 



1041 
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132 


<223> OTHER 


INFORMATION 


: Description of 


Unknown 


Organism><pbtai 


135 


<400> SEQUENCE: 


2 




















— 




136 


Met 


Ser 


Glu 


Pro 


Met 


Thr 


Lys 


Tyr 


Arg 


Gly Ala 


Ala 


Val 


Gin 


Ala 


Ala 


137 


1 








5 










10 










15 




138 


Pro 


Val 


Phe 


Leu 


Asp 


Leu Asp Arg 


Thr 


Val 


Glu 


Lys 


Ala 


He 


Gly 


Leu 


139 








20 










25 










30 






140 


He 


Glu 


Gin 


Ala 


Ala 


Lys 


Gin 


Asp 


Val 


Arg 


Leu 


lie 


Ala 


Phe 


Pro 


Glu 


141 






35 










40 










45 








142 


Thr 


Trp 


He 


Pro 


Gly 


Tyr 


Pro 


Phe 


Trp 


He 


Trp 


Leu 


Gly Ala 


Pro 


Ala 


143 




50 










55 










60 










144 


Trp 


Gly 


Met 


Arg 


Phe 


Val 


Gin 


Arg 


Tyr 


Phe 


Glu 


Asn 


Ser 


Leu 


Val 


Arg 


145 


65 










70 










75 










80 


146 


Gly 


Ser 


Lys 


Gin 


Trp 


Gin 


Ala 


Leu 


Ala 


Asp 


Ala 


Ala 


Arg 


Arg 


His 


Gly 


147 










85 










90 










95 




148 


Met 


His 


Val 


Val 


Ala 


Gly 


Tyr 


Ser 


Glu 


Arg 


Ala 


Gly 


Gly 


Ser 


Leu 


Tyr 


149 








100 










105 










110 






150 


Met 


Gly 


Gin 


Ala 


He 


Phe 


Gly 


Pro 


Asp 


Gly Asp 


Leu 


He 


Ala 


Ala 


Arg 


151 






115 










120 










125 








152 


Arg 


Lys 


Leu 


Lys 


Pro 


Thr 


His 


Ala 


Glu 


Arg 


Thr 


Val 


Phe 


Gly 


Glu 


Gly 


153 




130 










135 










140 










154 


Asp 


Gly 


Ser 


His 


Leu 


Ala 


Val 


His 


Asp 


Thr 


Ala 


He 


Gly Arg 


Leu 


Gly 


155 


145 










150 










155 










160 


156 


Ala 


Leu 


Cys 


Cys 


Trp 


Glu 


His 


He 


Gin 


Pro 


Leu 


Ser 


Lys 


Tyr 


Ala 


Met 


157 










165 










170 










175 




158 


Tyr 


Ala 


Ala 


Asp 


Glu 


Gin 


Val 


His 


Val 


Ala 


Ser 


Trp 


Pro 


Ser 


Phe 


Ser 


159 








180 










185 










190 






160 


Leu 


Tyr 


Arg 


Gly 


Met 


Ala 


Tyr 


Ala 


Leu 


Gly 


Pro 


Glu 


Val 


Asn 


Thr 


Ala 


161 






195 










200 










205 








162 


Ala 


Ser 


Gin 


He 


Tyr 


Ala 


Val 


Glu 


Gly 


Gly 


Cys 


Tyr 


Val 


Leu 


Ala 


Ser 


163 




210 










215 










220 










164 


Cys 


Ala 


Thr 


Val 


Ser 


Pro 


Glu 


Met 


He 


Lys 


Val 


Leu 


Val 


Asp 


Thr 


Pro 


165 


225 










230 










235 










240 


166 


Asp 


Lys 


Glu 


Met 


Phe 


Leu 


Lys 


Ala 


Gly 


Gly Gly 


Phe 


Ala 


Met 


He 


Phe 


167 










245 










250 










255 




168 


Gly 


Pro 


Asp 


Gly Arg 


Ala 


Leu 


Ala 


Glu 


Pro 


Leu 


Pro 


Glu 


Thr 


Glu 


Glu 


169 








260 










265 










270 






170 


Gly 


Leu 


Leu 


Val 


Ala 


Asp 


He 


Asp 


Leu 


Gly Met 


He 


Ala 


Leu 


Ala 


Lys 


171 






275 










280 










285 








172 


Ala 


Ala 


Ala 


Asp 


Pro 


Ala 


Gly 


His 


Tyr 


Ser 


Arg 


Pro 


Asp 


Val 


Thr 


Arg 


173 




290 










295 










300 










174 


Leu 


Leu 


Leu 


Asp 


Arg 


Arg 


Pro 


Ala 


Gin 


Arg 


Val 


Val 


Thr 


Leu 


Asp 


Ala 


175 


305 










310 










315 










320 


176 


Ala 


Phe 


Glu 


Pro 


Gin 


Asn 


Glu 


Asp 


Lys 


Gly Asp 


Ala 


Pro 


Ala 


Leu 


Arg 


177 










325 










330 










335 




178 


Val 


Val 


Ala 


Glu 


Ser 


Ala 


Ala 


Ala 


Ala 


Gin 














179 








340 










345 
















183 
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184 
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185 
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186 <213> ORGANISM: Unknown Organism 

188 <220> FEATURE: 

189 <223> OTHER INFORMATION: Description of 

190 environmental sample 
192 <220> FEATURE: 



Unknown Organism: Obtained from an 



193 


<221> NAME/KEY: 


CDS 


























194 


<222> LOCATION: 


(1) 


. . (1014) 






















196 


<4 00> SEQUENCE: 


3 


























197 


/atg 


aaa 


gaa 


get 


ate 


aag 


gtc 


gee 


tgc 


gtg 


caa 


gee 


gee 


ccg 


ate 


tac 


48 


198 


Met 


Lys 


Glu 


Ala 


He 


Lys 


Val 


Ala 


Cys 


Val 


Gin 


Ala 


Ala 


Pro 


He 


Tyr 




199 


1 








5 










10 










15 






201 


atg 


gat 


ttg 


gag 


gcg 

7 ZJ 


acg 


gtg 


gac 


aaa 


acc 


att 


gaq 


ttg 


atg 


qaa 


gaa 


96 


202 


Met 


Asp 


Leu 


Glu 


Ala 


Thr 


Val 


Asp 


Lys 


Thr 


He 


Glu 


Leu 


Met 


Glu 


Glu 




203 








20 










25 










30 








205 


gca 


gca 


cgt 


aat 


aat 


get 


cgt 


ctg 

zj 


ate 


gee 

ZJ 


ttt 


ccg 


gaa 


act 


tgg 

7 ZD 


att 


144 


206 


Ala 


Ala 


Arg 


Asn 


Asn 


Ala 


Arg 


Leu 


He 


Ala 


Phe 


Pro 


Glu 


Thr 


Trp 


He 




207 






35 










40 










45 










209 


cca 


ggc 


tac 


cca 


tgg 

7 7 


ttt 


ctt 


tgg 

7 7 


ctt 


gac 


tea 


cca 


gca 


tqq 

7 7 


gca 


atg 


192 


210 


Pro 


Gly 


Tyr 


Pro 


Trp 

It 


Phe 


Leu 


Trp 


Leu 


Asp 


Ser 


Pro 


Ala 


Trp 


Ala 


Met 




211 




50 










55 










60 












213 


caa 


ttt 


gta 


cgc 


caa 


tac 


cat 


gag 
7 — ' 


aac 


tea 


ttg 


gag 

7 7 


ttg 


gat 

ZJ 


ggc 

7 7 


cct 


240 


214 


Gin 


Phe 


Val 


Arg 


Gin 


Tyr 


His 


Glu 


Asn 


Ser 


Leu 


Glu 


Leu 


Asp 


Gly 


Pro 




215 


65 










70 










75 










80 




217 


caa 


get 


aag 


cgc 


att 


tea 


gat 


gca 


gee 

ZJ 


aag 

ZJ 


egg 

7 7 


ttg 


gga 

7 7 


ate 


atg 


gtc 


288 


218 


Gin 


Ala 


Lys 


Arg 


He 


Ser 


Asp 


Ala 


Ala 


Lys 


Arg 


Leu 


Gly 


He 


Met 


Val 




219 










85 










90 










95 






221 


acc 


ctg 


ggg 


atg 


agt 


gaa 


egg 


gtc 

ZJ 


ggt 

ZJ ZJ 


ggc 

ZJ ZJ 


acc 


ctt 


tac 


ate 


agt 


cag 


336 


222 


Thr 


Leu 


Gly 


Met 


Ser 


Glu 


Arg 


Val 


Gly 


Gly 


Thr 


Leu 


Tyr 


He 


Ser 


Gin 




223 








100 










105 










110 








225 


tgg 


ttc 


ata 


ggc 


gat 

ZJ 


aat 


ggt 


gac 
7 


acc 


att 


ggg 
777 


gee 


egg 

7 7 


cga 


aag 


ttg 


384 


226 


Trp 


Phe 


He 


Gly 


Asp 


Asn 


Gly 


Asp 


Thr 


He 


Gly 


Ala 


Arg 


Arg 


Lys 


Leu 




227 






115 










120 










125 










229 


aaa 


cct 


act 


ttt 


gtt 


gaa 


cgt 


act 


ttg 


ttc 


ggc 


gaa 


ggg 


gat 


ggt 


tea 


432 


230 


Lys 


Pro 


Thr 


Phe 


Val 


Glu 


Arg 


Thr 


Leu 


Phe 


Gly 


Glu 


Gly 


Asp 


Gly 


Ser 




231 




130 










135 










140 












233 


teg 


eta 


gcg 


gtt 


ttc 


gag 


acg 


tct 


gtt 


gga 


agg 


ctg 


ggt 


ggc 


tta 


tgc 


480 


234 


Ser 


Leu 


Ala 


Val 


Phe 


Glu 


Thr 


Ser 


Val 


Gly 


Arg 


Leu 


Gly 


Gly 


Leu 


Cys 




235 


145 










150 










155 










160 




237 


tgt 


tgg 


gag 


cac 


ctt 


caa 


ccg 


eta 


aca 


aaa 


tac 


get 


ttg 


tat 


gca 


caa 


528 


238 


Cys 


Trp 


Glu 


His 


Leu 


Gin 


Pro 


Leu 


Thr 


Lys 


Tyr 


Ala 


Leu 


Tyr 


Ala 


Gin 




239 










165 










170 










175 






241 


aat 


gaa 


gag 


att 


cat 


tgt 


gcg 


get 


tgg 


ccg 


age 


ttt 


age 


ctt 


tat 


cct 


576 


242 


Asn 


Glu 


Glu 


He 


His 


Cys 


Ala 


Ala 


Trp 


Pro 


Ser 


Phe 


Ser 


Leu 


Tyr 


Pro 




243 








180 










185 










190 








245 


aat 


gcg 


gcg 


aaa 


gec 


ctg 


ggg 


cct 


gat 


gtc 


aat 


gta 


gcg 


gee 


tct 


cga 


624 


246 


Asn 


Ala 


Ala 


Lys 


Ala 


Leu 


Gly 


Pro 


Asp 


Val 


Asn 


Val 


Ala 


Ala 


Ser 


Arg 




247 






195 










200 










205 










249 


ate 


tat 


gee 


gtt 


gaa 


ggg 


caa 


tgc 


ttc 


gta 


eta 


gcg 


teg 


tgt 


gcg 


etc 


672 


250 


He 


Tyr 


Ala 


Val 


Glu 


Gly 


Gin 


Cys 


Phe 


Val 


Leu 


Ala 


Ser 


Cys 


Ala 


Leu 
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251 
253 
254 
255 
257 
258 
259 
261 
262 
263 
265 
266 
267 
269 
270 
271 
273 
274 
275 
277 
278 
279 
281 
282 
285 
286 
287 
28 
28' 
289 
292 
293 
294 
295 
296 
297 
298 
299 
300 
301 
302 
303 
304 
305 
306 
307 
308 
309 
310 





210 










215 










220 












gtt 


tea 


caa 


tec 


atg 


ate 


gat 


atg 


ctt 


tgt 


aca 


gat 


gac 


gaa 


aag 


cat 


720 


Val 


Ser 


Gin 


Ser 


Met 


He 


Asp 


Met 


Leu 


Cys 


Thr 


Asp 


Asp 


Glu 


Lys 


His 




225 










230 










235 










240 




acq 

ZJ ZJ 


ttg 

ZJ 


ctt 


ctg 


get 


ggt 


ggt 


gga 


cac 


tea 


cgt 


ate 


ata 


QQQ 

ZJ ZJ ZJ 


cct 


gat 


768 


Ala 


Leu 


Leu 


Leu 


Ala 


Gly 


Gly 


Gly 


His 


Ser 


Arg 


He 


He 


Gly 


Pro 


Asp 












245 










250 










255 






ggt 


ggt 


gac 


ttg 


gtc 

— 


gcg 


cct 


ctt 


gee 


gaa 


aat 


gaa 


gag 


ggt 


att 


etc 


816 


Gly 


Gly 


Asp 


Leu 


Val 


Ala 


Pro 


Leu 


Ala 


Glu 


Asn 


Glu 


Glu 


Gly 


He 


Leu 










260 










265 










270 








tac 


\*A \*L 


aac 

\-A- «W 


ctt 


aat 


cct 




at a 


cac 

V»f M V-f 


ate 


ctt 


get 


aaa 

\JL \*JL \JL 


ata 


aca 


\*a \**t a 


864 

U U T 


Tyr 


Ala 


Asn 


Leu 


Asp 


Pro 


Gly 


Val 


Arg 


He 


Leu 


Ala 


Lys 


Met 


Ala 


Ala 








275 










280 










285 










gac 


cct 


get 


ggt 


cat 


tat 


tec 


cgt 


ccc 


gac 


att 


act 


cgc 


ttg 


eta 


ata 


912 


Asp 


Pro 


Ala 


Gly 


His 


Tyr 


Ser 


Arg 


Pro 


Asp 


He 


Thr 


Arg 


Leu 


Leu 


He 






290 










295 










300 












gat 


cgc 


age 


cct 


aaa 


tta 


ccg 


gta 


gtt 


gaa 


att 


gaa 


ggt 


gat 


ctt 


cgt 


960 


Asp 


Arg 


Ser 


Pro 


Lys 


Leu 


Pro 


Val 


Val 


Glu 


He 


Glu 


Gly 


Asp 


Leu 


Arg 




305 










310 










315 










320 




cct 


tac 


get 


ttg 


ggt 


aaa 


gcg 


tct 


gag 


acg 


ggt 


gcg 


caa 


etc 


gaa 


gaa 


1008 


Pro 


Tyr 


Ala 


Leu 


Gly 


Lys 


Ala 


Ser 


Glu 


Thr 


Gly Ala 


Gin 


Leu 


Glu 


Glu 












325 










330 










335 







tga 



att 
He 
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Description of Unknown Organism: 




Met 


Lys 


Glu 


Ala 


He 


Lys 


Val 


Ala 


Cys 


Val 


Gin 


Ala 


Ala 


Pro 


He 


Tyr 


1 








5 










10 










15 




Met 


Asp 


Leu 


Glu 


Ala 


Thr 


Val 


Asp 


Lys 


Thr 


He 


Glu 


Leu 


Met 


Glu 


Glu 








20 










25 










30 






Ala 


Ala 


Arg 


Asn 


Asn 


Ala 


Arg 


Leu 


He 


Ala 


Phe 


Pro 


Glu 


Thr 


Trp 


He 






35 










40 










45 








Pro 


Gly 


Tyr 


Pro 


Trp 


Phe 


Leu 


Trp 


Leu 


Asp 


Ser 


Pro 


Ala 


Trp 


Ala 


Met 




50 










55 










60 










Gin 


Phe 


Val 


Arg 


Gin 


Tyr 


His 


Glu 


Asn 


Ser 


Leu 


Glu 


Leu 


Asp 


Gly 


Pro 


65 










70 










75 










80 


Gin 


Ala 


Lys 


Arg 


He 


Ser 


Asp 


Ala 


Ala 


Lys 


Arg 


Leu 


Gly 


He 


Met 


Val 










85 










90 










95 




Thr 


Leu 


Gly 


Met 


Ser 


Glu 


Arg 


Val 


Gly 


Gly 


Thr 


Leu 


Tyr 


He 


Ser 


Gin 








100 










105 










110 






Trp 


Phe 


He 


Gly 


Asp 


Asn 


Gly 


Asp 


Thr 


He 


Gly Ala 


Arg 


Arg 


Lys 


Leu 






115 










120 










125 








Lys 


Pro 


Thr 


Phe 


Val 


Glu 


Arg 


Thr 


Leu 


Phe 


Gly 


Glu 


Gly Asp 


Gly 


Ser 




130 










135 










140 
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